智能论文笔记

PU GNN: Chargeback Fraud Detection in P2E MMORPGs via Graph Attention Networks with Imbalanced PU Labels

Jiho Choi , Junghoon Park , Woocheol Kim , Jin-Hyeok Park , Yumin Suh , Minchang Sung

分类：机器学习

2022-11-16

The recent advent of play-to-earn (P2E) systems in massively multiplayer online role-playing games (MMORPGs) has made in-game goods interchangeable with real-world values more than ever before. The goods in the P2E MMORPGs can be directly exchanged with cryptocurrencies such as Bitcoin, Ethereum, or Klaytn via blockchain networks. Unlike traditional in-game goods, once they had been written to the blockchains, P2E goods cannot be restored by the game operation teams even with chargeback fraud such as payment fraud, cancellation, or refund. To tackle the problem, we propose a novel chargeback fraud prediction method, PU GNN, which leverages graph attention networks with PU loss to capture both the players' in-game behavior with P2E token transaction patterns. With the adoption of modified GraphSMOTE, the proposed model handles the imbalanced distribution of labels in chargeback fraud datasets. The conducted experiments on two real-world P2E MMORPG datasets demonstrate that PU GNN achieves superior performances over previously suggested methods.

translated by 谷歌翻译

Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems

Hanieh Hashemi , Wenjie Xiong , Liu Ke , Kiwan Maeng , Murali Annavaram , G. Edward Suh , Hsien-Hsin S. Lee

分类：机器学习

2022-12-12

Online personalized recommendation services are generally hosted in the cloud where users query the cloud-based model to receive recommended input such as merchandise of interest or news feed. State-of-the-art recommendation models rely on sparse and dense features to represent users' profile information and the items they interact with. Although sparse features account for 99% of the total model size, there was not enough attention paid to the potential information leakage through sparse features. These sparse features are employed to track users' behavior, e.g., their click history, object interactions, etc., potentially carrying each user's private information. Sparse features are represented as learned embedding vectors that are stored in large tables, and personalized recommendation is performed by using a specific user's sparse feature to index through the tables. Even with recently-proposed methods that hides the computation happening in the cloud, an attacker in the cloud may be able to still track the access patterns to the embedding tables. This paper explores the private information that may be learned by tracking a recommendation model's sparse feature access patterns. We first characterize the types of attacks that can be carried out on sparse features in recommendation models in an untrusted cloud, followed by a demonstration of how each of these attacks leads to extracting users' private information or tracking users by their behavior over time.

translated by 谷歌翻译

Consecutive Knowledge Meta-Adaptation Learning for Unsupervised Medical Diagnosis

Yumin Zhang , Yawen Hou , Xiuyi Chen , Hongyuan Yu , Long Xia

分类：计算机视觉

2022-09-21

基于深度学习的计算机辅助诊断（CAD）在学术研究和临床应用中引起了吸引人的关注。然而，卷积神经网络（CNN）诊断系统严重依赖于标记的病变数据集，对数据分布变化的敏感性也限制了CNN在CAD中的潜在应用。开发了无监督的域适应性（UDA）方法来解决昂贵的注释和域间隙问题，并在医学图像分析中取得了巨大的成功。然而，现有的UDA方法仅适应从源病变域中汲取的知识到一个单个目标病变域，这是针对临床情况的：要诊断的新的未标记的目标域始终以在线和连续的方式到达。此外，由于新知识的知识覆盖了先前学到的知识（即灾难性的遗忘），因此现有方法的性能在先前学到的目标病变域上大大降低。为了处理上述问题，我们开发了一个名为连续病变知识元适应（CLKM）的元适应框架，该框架主要由语义适应阶段（SAP）和表示适应阶段（RAP）组成，以在线学习诊断模型和连续的方式。在SAP中，从源病变域中学到的语义知识转移到连续的靶病变域。在RAP中，优化了功能提取器以对齐整个源和多个目标病变域的可转移表示知识。

translated by 谷歌翻译

The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022

Sangwon Suh , Sunjong Park

分类：人工智能 | 机器学习

2022-09-21

在本文中，我们描述了RTZR团队Voxceleb扬声器识别挑战2022（VOXSRC-22）的最高得分提交，在封闭的数据集中，扬声器验证轨道1.最高执行的系统是7型型号的融合，其中包含3种不同类型的类型模型体系结构。我们专注于培训模型以学习周期性信息。因此，所有型号均以4-6秒的镜头训练，每次发言。此外，我们采用了较大的保证金微调策略，该策略在我们的某些融合模型的先前挑战上表现出良好的表现。在评估过程中，我们应用了具有自适应对称归一化（AS-NORM）和矩阵得分平均值（MSA）的评分方法。最后，我们将模型与逻辑回归混合在一起，以融合所有受过训练的模型。最终提交在VOXSRC22测试集上实现了0.165 DCF和2.912％EER。

translated by 谷歌翻译

Measuring and Controlling Split Layer Privacy Leakage Using Fisher Information

Kiwan Maeng , Chuan Guo , Sanjay Kariyappa , Edward Suh

分类：机器学习

2022-09-21

拆分学习和推理建议运行跨客户设备和云的大型模型的培训/推理。但是，这样的模型拆分引起了隐私问题，因为流过拆分层的激活可能会泄漏有关客户端私人输入数据的信息。当前，没有一个好方法可以量化通过分层泄漏多少私人信息，也没有一种将隐私提高到所需级别的好方法。在这项工作中，我们建议将Fisher信息用作隐私指标来衡量和控制信息泄漏。我们表明，Fisher信息可以直观地理解以无偏重建攻击者的限制的错误形式通过拆分层泄漏了多少私人信息。然后，我们提出了一种增强隐私的技术REFIL，可以在拆分层上强制使用用户呈现的Fisher信息泄漏，以实现高隐私，同时保持合理的实用程序。

translated by 谷歌翻译

TASKED: Transformer-based Adversarial learning for human activity recognition using wearable sensors via Self-KnowledgE Distillation

Sungho Suh , Vitor Fortes Rey , Paul Lukowicz

分类：计算机视觉 | 机器学习

2022-09-14

Wearable sensor-based human activity recognition (HAR) has emerged as a principal research area and is utilized in a variety of applications. Recently, deep learning-based methods have achieved significant improvement in the HAR field with the development of human-computer interaction applications. However, they are limited to operating in a local neighborhood in the process of a standard convolution neural network, and correlations between different sensors on body positions are ignored. In addition, they still face significant challenging problems with performance degradation due to large gaps in the distribution of training and test data, and behavioral differences between subjects. In this work, we propose a novel Transformer-based Adversarial learning framework for human activity recognition using wearable sensors via Self-KnowledgE Distillation (TASKED), that accounts for individual sensor orientations and spatial and temporal features. The proposed method is capable of learning cross-domain embedding feature representations from multiple subjects datasets using adversarial learning and the maximum mean discrepancy (MMD) regularization to align the data distribution over multiple domains. In the proposed method, we adopt the teacher-free self-knowledge distillation to improve the stability of the training procedure and the performance of human activity recognition. Experimental results show that TASKED not only outperforms state-of-the-art methods on the four real-world public HAR datasets (alone or combined) but also improves the subject generalization effectively.

translated by 谷歌翻译

Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning using Independent Component Analysis

Sanjay Kariyappa , Chuan Guo , Kiwan Maeng , Wenjie Xiong , G. Edward Suh , Moinuddin K Qureshi , Hsien-Hsin S. Lee

分类：机器学习 | 人工智能

2022-09-12

联合学习（FL）旨在对多个数据所有者持有的分布式数据执行隐私的机器学习。为此，FL要求数据所有者在本地执行培训，并与中央服务器共享梯度更新（而不是私人输入），然后将其安全地汇总在多个数据所有者上。尽管汇总本身并不能证明提供隐私保护，但先前的工作表明，如果批处理大小足够大，则足够了。在本文中，我们提出了鸡尾酒会攻击（CPA），与先前的信念相反，能够从汇总的渐变中恢复私人输入，这是批量较大的大小。 CPA利用了至关重要的见解，即来自完全连接的层的总梯度是其输入的线性组合，这使我们将梯度反演作为盲源分离（BSS）问题（非正式地称为鸡尾酒会问题）。我们适应独立的组件分析（ICA） - BSS问题的经典解决方案 - 恢复针对完全连接和卷积网络的私人输入，并表明CPA明显优于先前的梯度反转攻击，对成像网的输入量表，并表现出Imagenet大小的输入的范围最高可达1024的大批量。

translated by 谷歌翻译

Global Planning for Contact-Rich Manipulation via Local Smoothing of Quasi-dynamic Contact Models

Tao Pang , H. J. Terry Suh , Lujie Yang , Russ Tedrake

分类：机器人

2022-06-22

增强学习（RL）在接触式操纵中的经验成功（RL）从基于模型的角度来理解了很多待理解，其中关键困难通常归因于（i）触点模式的爆炸，（ii）僵硬，非平滑接触动力学和由此产生的爆炸 /不连续梯度，以及（iii）计划问题的非转换性。 RL的随机性质通过有效采样和平均接触模式来解决（i）和（ii）。另一方面，基于模型的方法通过分析平滑接触动力学来解决相同的挑战。我们的第一个贡献是建立两种方法的简单系统方法的理论等效性，并在许多复杂示例上提供定性和经验的等效性。为了进一步减轻（II），我们的第二个贡献是凸面的凸面，可区分和准动力的触点动力学表述，这两个方案都可以平滑方案，并且通过实验证明了对接触富含接触的计划非常有效。我们的最终贡献解决了（III），在其中我们表明，当通过平滑度抽取接触模式时，基于经典的运动计划算法在全球计划中可以有效。将我们的方法应用于具有挑战性的接触式操纵任务的集合中，我们证明了基于模型的有效运动计划可以实现与RL相当的结果，而计算却大大较少。视频：https：//youtu.be/12ew4xc-vwa

translated by 谷歌翻译

Do Differentiable Simulators Give Better Policy Gradients?

H. J. Terry Suh , Max Simchowitz , Kaiqing Zhang , Russ Tedrake

分类：机器学习 | 人工智能 | 机器人

2022-02-02

通过基于一阶梯度的估计，通过替换零阶梯度估计来替换零阶梯度估计，可以通过估算零阶梯度估计来更快地计算时间。但是，尚不清楚哪些因素决定了两个估计量在复杂景观上的性能，尽管该问题对于可区分的模拟器的实用性至关重要，但涉及长途计划和对物理系统的控制。我们表明，某些物理系统的特征，例如刚度或不连续性，可能会损害一阶估计器的功效，并通过偏置和方差的镜头分析这种现象。我们还提出了一个$ \ alpha $ - 订单梯度估计器，并在[0,1] $中使用$ \ alpha \，它正确利用了精确的梯度将一阶估计值的效率与零级方法的鲁棒性结合在一起。我们在一些数值示例中证明了传统估计器的陷阱以及$ \ alpha $订单估计器的优势。

translated by 谷歌翻译

Explainable deep learning for insights in El Nino and river flows

Yumin Liu , Kate Duffy , Jennifer G. Dy , Auroop R. Ganguly

分类：机器学习

2022-01-07

El Nino Southern振荡（ENSO）是热带中央和东太平洋的海面温度（SST）的半周期波动，通过远程依赖或电信连接，影响世界各地的区域水文中的际变化。最近的研究表明了改进ENSO预测以及用于了解电信连接的复杂网络（CN）的深度学习（DL）方法的价值。然而，预测对Enso驱动的河流流动的差距包括DL的黑匣子性质，使用简单的ENSO指数来描述复杂的现象并将基于DL的ENSO预测翻译成河流预测。在这里，我们显示可解释的DL（XDL）方法，基于显着性图，可以提取全球SST中包含的可解释的预测信息，并发现对河流的新型SST信息区域和依赖结构，这些信息与气候网络结构串联，使得改进的预测性理解。我们的结果揭示了全球SST超越ENSO指数的更多信息内容，开发了对SSTS影响河流的新了解，并产生了与不确定性的改进的河流预测。观察，重新分析数据和地球系统模型模拟用于展示基于XDL-CN基于互际和分支尺度的气候预测的方法的价值。

translated by 谷歌翻译